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Abstract 


Theory  resolution  constitutes  a  set  of  complete  procedures  for  incorporating  theories  into  a 
resolution  theorem-proving  program,  thereby  making  it  unnecessary  to  resolve  directly  upon 
axioms  of  the  theory.  This  can  greatly  reduce  the  length  of  proofs  and  the  size  of  the  search 
space.  Theory  resolution  effects  a  beneficial  division  of  labor,  improving  the  performance 
of  the  theorem  prover  and  increasing  the  applicability  of  the  specialized  reasoning  proce¬ 
dures.  Total  theory  resolution  utilizes  a  decision  procedure  that  is  capable  of  determining 
unsatisfiability  of  any  set  of  clauses  using  predicates  in  the  theory.  Partial  theory  resolution 
employs  a  weaker  decision  procedure  that  can  determine  potential  unsatisfiability  of  sets 
of  literals.  Applications  include  the  building  in  of  both  mathematical  and  special  decision 
procedures,  e.g.,  for  the  taxonomic  information  furnished  by  a  knowledge  representation 
system.  Theory  resolution  is  a  generalization  of  numerous  previously  known  resolution  re¬ 
finements.  Its  power  is  demonstrated  by  comparing  solutions  of  “Schubert’s  Steamroller” 
challenge  problem  with  and  without  building  in  axioms  through  theory  resolution. 
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1  Introduction 


Incorporating  a  theory  into  derived  inference  rules  so  that  its  axioms  are  never  resolved  upon 
has  enormous  potential  for  reducing  the  size  of  the  exponential  search  space  commonly  en¬ 
countered  in  resolution  theorem  proving  [11,17,5,38],  Theory  resolution  is  a  method  of 
incorporating  specialized  reasoning  procedures  in  a  resolution  theorem  prover  so  that  the 
reasoning  task  will  be  effectively  divided  into  two  parts:  special  cases,  such  as  reasoning 
about  inequalities  or  about  taxonomic  information,  are  handled  efficiently  by  specialized 
reasoning  procedures,  while  more  general  reasoning  is  handled  by  resolution.  The  connec¬ 
tion  between  the  two  reasoning  components  is  made  by  having  the  resolution  procedure 
resolve  on  sets  of  literals  whose  conjunction  is  determined  to  be  unsatisfiable  by  the  special¬ 
ized  reasoning  procedure,  rather  than  by  just  using  ordinary  unification  of  complementary 
literals.  The  objective  of  research  on  theory  resolution  is  the  conceptual  design  of  deduction 
systems  that  combine  deductive  specialists  within  the  common  framework  of  a  resolution 
theorem  prover. 

We  are  incorporating  our  work  on  theory  resolution  in  the  development  of  a  deduction 
system  [31]  for  use  in  the  KLAUS  natural-language-understanding  system  [14].  The  use  of 
theory  resolution  to  help  incorporate  theories  of  taxonomies,  orderings,  etc.,  should  be  very 
beneficial,  given  the  pervasive  need  for  these  forms  of  reasoning  in  carrying  out  the  task  of 
understanding  language  and  the  real  world. 

Concern  has  often  been  expressed  about  the  ineffectiveness  of  applying  resolution  the¬ 
orem  proving'to  problems  in  artificial  intelligence.  Theory  resolution  is  designed  to  partly 
address  this  concern  by  providing  a  means  for  incorporating  specialized  reasoning  proce¬ 
dures  in  a  resolution  theorem  prover.  The  division  of  labor  achieved  in  the  reasoning  process 
by  theory  resolution  is  intended  to  produce  the  dual  advantages  of  improving  the  theorem 
prover’s  performance  by  the  use  of  more  efficient  reasoning  procedures  for  special  cases  and 
of  increasing  the  range  of  application  of  the  specialized  reasoning  procedures  by  including 
them  in  a  more  general  reasoning  system. 

Past  criticisms  of  resolution  can  often  be  characterized  by  their  pejorative  use  of  the 
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terms  uniform  and  syntactic.  Theory  resolution  meets  these  objections  head-on.  In  theory 
resolution,  a  specialized  reasoning  procedure  may  be  substituted  for  ordinary  syntactic 
unification  to  determine  unsatisfiability  of  sets  of  literals.  Because  the  implementation  of 
this  specialized  reasoning  procedure  is  unspecified — to  the  theorem  prover  it  is  a  “black 
box”  with  prescribed  behavior,  namely,  able  to  determine  unsatisfiability  in  the  theory 
it  implements — the  resulting  system  is  nonuniform  because  reasoning  within  the  theory 
is  performed  by  the  specialized  reasoning  procedure,  while  reasoning  outside  the  theory 
is  performed  by  resolution.  Theory  resolution  can  also  be  regarded  as  being  not  wholly 
syntactic,  since  the  conditions  for  resolving  on  a  set  of  literals  are  no  longer  based  on  their 
being  made  syntactically  identical,  but  rather  on  their  being  unsatisfiable  in  a  theory,  and 
thus  resolvability  is  partly  semantic. 

Besides  being  a  higher-level  inference  rule  than  ordinary  resolution  by  virtue  of  its  use 
of  specialized  reasoning  procedures,  theory  resolution  is  also  a  higher-level  rule  because,  as 
in  hyperresolution,  it  allows  inferences  requiring  more  than  two  parent  clauses. 

Theory  resolution  can  be  seen  partly  as  an  extension  of  the  work  on  building  in  equa- 
tional  theories  (i.e.,  theories  that  can  he  expressed  as  a  set  of  equalities)  [26].  This  consisted 
of  using  special  unification  algorithms  and  reducing  terms  to  normal  form.  This  work  has 
been  extended  substantially,  particularly  in  the  area  of  development  of  special  unification 
algorithms  for  various  equational  theories  [29]. 

Not  all  theories  that  it  would  be  useful  to  incorporate  are  equational.  For  example, 
reasoning  about  orderings  and  other  transitive  relations  is  often  necessary,  but  using  or¬ 
dinary  resolution  for  this  is  quite  inefficient.  It  is  possible  to  derive  an  infinite  number  of 
consequences  from  a  <  b  and  (z  <  y)  A  (y  <  r)  D  (z  <  z)  despite  the  obvious  fact  that 
a  refutation  based  on  just  these  two  formulas  is  impossible.  A  solution  to  this  problem  is 
to  require  that  use  of  the  transitivity  axiom  be  restricted  to  occasions  when  either  there 
are  matches  for  two  of  its  literals  (partial  theory  resolution)  or  a  complete  refutation  of  the 
ordering  part  of  the  clauses  can  be  found  (total  theory  resolution). 

An  important  form  of  reasoning  in  artificial  intelligence  applications  embodied  in  knowl- 
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edge  representation  systems  [10]  is  reasoning  about  taxonomic  information  and  property 
inheritance.  One  of  our  goals  is  to  be  able  to  take  advantage  of  the  efficient  reasoning 
provided  by  a  knowledge  representation  system  by  using  it  as  a  taxonomy  decision  proce¬ 
dure  in  a  larger  deduction  system.  This  makes  sense  because  it  relieves  the  general-purpose 
deduction  system  of  the  need  to  perform  taxonomic  reasoning  and  because  it  extends  the 
power  of  the  knowledge  representation  system  towards  greater  logical  completeness.  Other 
researchers  have  also  cited  the  advantages  of  integrating  knowledge  representation  systems 
with  more  general  deduction  systems  [9,27].  KRYPTON  [8,25]  represents  an  approach  to 
constructing  a  knowledge  representation  system  composed  of  two  parts:  a  terminological 
component  (the  TBox)  and  an  assertional  component  (the  ABox).  For  such  systems,  theory 
resolution  indicates  in  general  how  information  can  be  provided  to  the  ABox  by  the  TBox 
and  how  it  can  be  used  by  the  ABox. 

2  Theory  Resolution 

We  will  now  define  the  theory  resolution  operation  and,  in  succeeding  sections,  discuss 
various  useful  restrictions  on  theory  resolution.  We  will  limit  our  discussion  to  the  variable- 
free  “ground*  case  of  theory  resolution,  since  lifting  to  the  general  case  is  straightforward. 
In  the  general  case,  it  is  required  only  that  all  ground  theory  resolvents  of  instances  of  the 
formulas  be  represented  as  instances  of  general  theory  resolvents. 

We  will  assume  the  standard  definitions  of  a  term,  an  atomic  formula  (atom),  and  a 
literal.  In  forming  literals  from  atomic  formulas,  the  symbol  -i  will  be  used  to  represent 
negation.  It  is  also  used  to  represent  the  operator  that  forms  a  literal  of  the  opposite 
polarity,  e.g.,  if  L  is  the  literal  ->A  then  -i L  denotes  A. 

We  will  consider  a  clause  to  be  a  disjunction  of  n  >  0  literals.  If  n  =  0,  the  clause  is 
the  empty  clause  □.  If  n  =  1,  the  clause  is  a  unf/  clause.  The  disjunction  connective  V  is 
assumed  to  be  associative  and  commutative,  i.e.,  ordering  of  the  literals  in  a  disjunction  is 
immaterial;  V  is  also  assumed  to  be  idempotent,  i.e.,  forming  the  disjunction  of  two  clauses 
which  contain  literals  in  common  will  result  in  only  one  occurrence  of  each  such  literal 
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appearing  in  the  resulting  disjunction.  The  empty  clause  □  is  the  identity  element  for  V. 
We  will  generally  make  no  distinction  between  a  unit  clause  and  the  single  literal  of  which 
it  is  composed. 

We  will  also  use  the  standard  A  (conjunction),  D  (implication),  and  s  (equivalence) 
connectives  in  examples. 

We  will  assume  the  standard  definitions  of  an  interpretation,  an  interpretation  satisfying 
or  falsifying  a  formula  or  set  of  formulas,  and  a  formula  or  set  of  formulas  being  satisfiable 
or  unsatisfiable. 

Any  satisfiable  set  of  formulas  that  we  wish  to  incorporate  into  the  inference  process  can 
be  regarded  as  a  theory.  This  definition  of  a  theory  being  a  set  of  formulas  is  used  in  defining 
T-unsatisfiability,  etc.,  but  should  not  be  taken  too  seriously  for  implementation  purposes. 
An  objective  of  theory  resolution  is  the  incorporation  of  theories  into  the  inference  process 
so  that  it  will  be  unnecessary  to  resolve  directly  upon  the  axioms  of  the  incorporated  theory. 
Moreover,  theory  resolution  does  not  require  any  direct  representation  of  the  formulas  of 
the  theory  in  the  definition  of  the  inference  operations  either.  For  example,  the  decision 
procedure  that  theory  resolution  requires  can  be  a  computer  program,  and  the  formulas  that 
comprise  the  theory  could  only  be  ascertained  by  examining  the  structure  and  behavior  of 
the  program. 

Definition  1  A  T 'interpretation  is  an  interpretation  that  satisfies  [the  formulas  of]  theory 

T. 

For  example,  in  a  theory  of  partial  ordering  ORD  consisting  of  -<(x  <  x)  and  ( x  < 
y)  A  (y  <  r)  D  (j  <  z),  the  predicate  <  cannot  be  interpreted  so  that  a  <  a  has  value  true 
or  a  <  e  has  value  false  if  a  <  b  and  b  <  e  both  have  value  true.  In  a  taxonomic  theory  TAX 
including  Boy[x)  D  Person(x),  Boy(John)  cannot  have  value  true  while  Person(John)  has 
value  false. 

Definition  2  A  set  of  clauses  S  is  T-unsatisfiable  iff  no  T-interpretation  satisfies  S.  S  is 
minimally  T 'Unsatisfiable  iff  S,  but  no  proper  subset  of  S,  is  T-unsatisfiable. 
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Definition  3  Let  C\,. . .  ,Cm  (m  >  1)  be  a  set  of  nonempty  clauses,  let  each  C,-  be  de¬ 
composed  as  Ki  V  Li  where  if,-  is  a  nonempty  clause,  and  let  Pi, . . . ,  Rn  (n  >  0)  be  unit 
clauses.  Suppose  the  set  of  clauses  K\, . . .  ,Km,  R\,  •  ■  -  ,Rn  is  T-unsatisfiable.  Then  the 
clause  Li  V  •  •  •  V  Lm  V  -i R\  V  •  •  ■  V  -iRn  is  a  theory  resolvent  using  theory  T  (T -resolvent) 
of  Ci, . . . ,  Cm.  The  theory  resolvent  is  called  an  m-ary  theory  resolvent  (unary  iff  m  =  1, 
61'nary  iff  m  =  2).  It  is  a  total  theory  resolvent  iff  n  =  0;  otherwise  it  is  partial.  Km 

is  called  the  key  of  the  theory  resolution  operation.  For  partial  theory  resolvents,  R\ , . . . , 
is  a  set  of  conditions  for  the  T-unsatisfiability  of  the  key.  The  negation  -iRi  V  *  ■  ■  V  -iR„  of 
the  conjunction  of  the  conditions  is  called  the  residue  of  the  theory  resolution  operation.  It 
is  a  narrow  theory  resolvent  iff  each  K,  is  a  unit  clause;  otherwise  it  is  wide. 

The  definition  above  classifies  individual  theory  resolution  operations  according  to  whether 
they  are  total  or  partial  and  wide  or  narrow.  Theory  resolution  procedures  are  classified 
according  to  the  operations  allowed:  partial  theory  resolution  permits  total  as  well  as  par¬ 
tial  theory  resolution  operations,  while  total  theory  resolution  permits  only  total  theory 
resolution  operations;  wide  theory  resolution  permits  narrow  as  well  as  wide  theory  res¬ 
olution  operations,  while  narrow  theory  resolution  permits  only  narrow  theory  resolution 
operations.  Thus  partial  theory  resolution  includes  total  theory  resolution  and  wide  theory 
resolution  includes  narrow  theory  resolution. 

Example  4  A  set  of  unit  clauses  is  unsatisfiable  in  the  theory  of  partial  ordering  ORD  iff 
it  contains  a.chain  of  inequalities  1 1  <  •••  <  tn[n  >  2)  such  that  either  tj  is  the  same  as 
tn  or  — i(i i  <  tn)  is  also  one  of  the  clauses.  P  is  a  unary  total  narrow  OPD-resolvent  of 
(a  <  a)  VP.  P  V  Q  is  a  binary  total  narrow  OPD-resolvent  of  (a  <  b)  V  P  and  (6  <  a)  V  Q. 
PS/Q  V  RV  S  is  a  4-ary  total  narrow  ORD-  resol  vent  of  (a  <  6)  VP,  (6  <  c)VQ,  (c  <  d)  V  R, 
and  -i(a  <  d)  V  S.  This  can  also  be  derived  incrementally  through  partial  narrow  ORD- 
resolution,  i.e.,  by  resolving  (a  <  b)  V  P  and  (6  <  c)  V  Q  to  obtain  (a  <  c)  V  PV  Q,  resolving 
that  with  (c  <  d)  V  R  to  obtain  (a  <  d)  V  P  V  Q  V  R,  and  resolving  that  with  -i(a  <  d)  V  S 
to  obtain  P  V  Q  V  R  V  S . 
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Example  5  Suppose  the  taxonomic  theory  TAX  includes  a  definition  for  fatherhood 
Father(x)  =  [Man(x )  A  3yChild(x,y)\.  Then  Father(Fred)  is  a  partial  wide  theory 
resolvent  of  Child{Fred,  Pat)  V  Child(Fred,  Sandy)  and  Man(Fred).  Also,  □  is  a  to¬ 
tal  wide  theory  resolvent  of  Child(Fred,  Pat)  V  Child(Fred,  Sandy),  Man(Fred),  and 
-iFather[Fred). 

Thus,  the  type  of  reasoning  that  is  employable  in  the  decision  procedure  can  be  quite 
different  from  and  more  effective  in  its  domain  than  resolution. 

Note  that  the  definition  of  a  theory  resolvent  includes  ordinary  resolvents  since  the 
unsatisfiability  of  pairs  of  complementary  literals  implies  their  T-unsatisfiability. 

The  following  proves  the  soundness  of  theory  resolution. 

Theorem  6  Let  T  be  a  theory,  S  a  set  of  clauses,  and  C  a  T -resolvent  of  S.  Then  every 
T -interpretation  I  that  satisfies  S  also  satisfies  C. 

Proof:  Let  I  be  any  T-interpretation  that  satisfies  S.  Consider  any  decomposition  of 
clauses  in  S  into  Ki  and  L,-  leading  to  T-resolvent  C  being  L\  V  •  •  ■  V  Lm  V  -iRj  V  •  *  •  V-i Rn. 
Suppose  I  satisfies  some  L,\  Then  I  also  satisfies  C,  Alternatively,  suppose  /  falsifies  every 

Li-  Because  I  satisfies  S,  I  must  satisfy  every  Ki.  But  since  K\ . Km,  Ri, . . . ,  Rn  is 

T-unsatisfiable,  I  must  falsify  at  least  one  Rj.  Thus,  I  must  satisfy  at  least  one  -> Rj  and 
consequently  C.  I 

Definition-3  states  what  can  be  inferred  by  theory  resolution  and  Theorem  6  confirms 
that  the  inferences  are  &U  valid.  However,  not  all  instances  of  theory  resolution  satisfying 
the  definition  need  actually  be  inferred  for  completeness.  This  is  important  because  the 
definition  of  theory  resolution  is  too  general  to  be  usefully  applied  directly.  For  example,  if 
C\, ... ,  Cm  is  any  T-unsatisfiable  set  of  clauses  then  □  is  always  derivable  from  Cj , . . . ,  Cm 
in  a  single  total  wide  theory  resolution  step  with  each  C{  decomposed  into  Ki  =  Q  and 
Li  —  □.  This  will  be  necessary  for  some  sets  of  clauses,  but  to  require  it  always  would 
make  the  definition  of  theory  resolution  essentially  useless. 
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In  the  following  sections,  we  will  explore  some  possible  restrictions  on  the  definition  of 
theory  resolution  that  make  it  practical  to  apply  while  preserving  completeness.  The  first 
restriction  we  will  consider  is  an  instance  of  total  wide  theory  resolution.  In  this  form  of 
total  wide  theory  resolution,  clauses  are  divided  into  two  subclauses — one  using  predicates 
in  the  theory,  the  other  not.  If  for  some  set  of  clauses  the  conjunction  of  the  theory 
subclauses  (the  key)  is  7’*unsatisfiable,  then  the  disjunction  of  the  nontheory  subclauses 
can  be  derived  by  theory  resolution.  This  is  a  powerful  and  general  procedure,  but  has  the 
disadvantage  that  deciding  T -unsatisfiability  of  sets  of  clauses  may  be  too  difficult.  Certainly 
it  is  infeasible  for  some  specialized  reasoning  procedures,  such  as  taxonomic  reasoning  in 
knowledge  representation  systems,  that  are  incapable  of  dealing  with  disjunction. 

More  generally,  it  is  often  convenient  to  be  able  to  use  a  simpler  decision  procedure 
than  one  capable  of  deciding  7’-unsatisfiabiiity  of  sets  of  clauses.  Narrow  theory  resolution 
is  proposed  for  such  cases.  In  narrow  theory  resolution,  only  T-unsatisfiability  of  set9  of 
literals,  not  clauses,  must  be  decided.  Total  and  partial  narrow  theory  resolution  are  both 
possible.  In  total  narrow  theory  resolution,  the  resolved-upon  literals  (the  key)  must  be 
T-unsatisfiable.  In  partial  narrow  theory  resolution,  the  key  must  be  T-unsatisfiable  only 
under  some  conditions.  The  negated  conditions  are  used  as  the  residue  in  the  formation 
of  the  resolvent.  Partial  narrow  theory  resolution  generalizes  results  presented  in  [32]  by 
allowing  resolution  with  any  number  of  key  literals,  i.e.,  by  not  being  limited  to  binary 
theory  resolution. 

2.1  Total  Wide  Theory  Resolution 

Suppose  we  have  a  decision  procedure  for  T  that  iB  capable  of  finding  all  minimally  T- 
unsatisfiable  subsets  of  any  set  of  clauses  containing  only  predicates  in  T.  This  procedure 
could  then  be  applied  to  S  with  all  literals  having  predicates  not  in  T  removed.  For  each 
minimally  T'-unsatisfiable  subset  discovered,  a  total  wide  theory  resolvent,  which  contains 
no  occurrences  of  predicates  in  T,  can  be  derived. 

This  is  a  particular  case  of  wide  theory  resolution  in  which  the  literals  that  are  resolved 
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upon  by  means  of  theory  resolution  are  determined  by  their  predicate  symbols.  Following 
is  a  completeness  proof  for  this  form  of  wide  theory  resolution. 

Lemma  7  Let  S  be  an  unsatisfiable  set  of  clauses  and  P  a  set  of  predicates.  Let  S  be 
decomposed  into  S  =  Sp  U  Sp  so  that  every  clause  in  Sp  has  an  occurrence  of  a  predicate 
in  P  and  no  clause  in  Sp  has  an  occurrence  of  a  predicate  in  P.  Each  clause  CV  in  Sp  is 
of  the  form  Ki\/  Li,  where  all  predicates  in  K{  are  in  P  and  all  predicates  in  L,  are  not  in 
P.  Each  K{  is  nonempty,  but  Li  might  be  empty.  Let  X  be  the  set  of  all  clauses  such  that 
Cit , . . . ,  C,m  are  clauses  in  Sp,  Ki , ,  Klm  is  a  minimally  unsatisfiable  set  of  clauses,  and 
the  clause  in  X  is  of  the  form  L,-,  V  •  •  •  V  Lim.  Then  Sp  U  X  is  unsatisfiable. 

Proof:  Because  S  i’b  unsatisfiable,  and  by  virtue  of  the  completeness  of  A-ordered 
resolution,1  there  exists  an  A-ordered  resolution  refutation  of  S  with  predicates  in  P  pre¬ 
ceding  predicates  not  in  P  in  the  A-ordering.  The  refutation  contains  a  possibly  empty  Bet 
X'  of  derived  clauses  that  contain  no  predicates  in  P  but  whose  parents  contain  predicates 
in  P.  By  removing  the  derivations  of  clauBeB  in  X1  from  the  refutation  of  S,  we  obtain 
a  refutation  of  Sp  U  X'.  Thus  Sp  U  X1  is  unsatisfiable.  When  we  look  at  the  A-ordered 
derivation,  it  is  apparent  that  each  clause  in  X1  must  be  of  the  form  L,-,  V  •  •  •  V  L,-m  and 
that  the  corresponding  set  K,x ,...,  K{m  must  be  unsatisfiable.  If  K{x , . . . ,  iT,m  is  minimally 
unsatisfiable,  then  the  clause  in  X1  also  belongs  to  X.  If  if;,, . .. ,  K{m  is  not  minimally  un- 
satiBfiable,  then  the  clause  in  X'  is  still  subsumed  by  (possibly  identical  to)  an  element  of  X. 
Because  X  contains  each  element  of  X'  or  an  element  that  subsumes  it,  the  unsatisfiability 
of  Sp  U  X  follows  from  the  unsatisfiability  of  Sp  U  X'.  I 

Theorem  8  Let  S  be  a  T -unsatisfiable  set  of  clauses.  Then  there  is  a  refutation  of  S 
(derivation  of  □  from  S)  using  total  wide  theory  resolution  with  theory  T. 

Proof:  Consider  representing  the  theory  T  as  a  set  of  clauses  also  called  T.  Because 
S  is  T-unsatisfiable,  S  U  T  is  unsatisfiable.  Let  P  be  the  set  of  predicates  occurring  in  T. 

'A-ordered  resolution  [11,17)  is  a  refinement  of  resolution  that  permits  resolution  only  on  the  literals  of 
each  clause  whose  atoms  appear  earliest  in  a  fixed  ordering  (the  A-ordering). 
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By  the  lemma,  (5  U  T)p  U  X,  which  equals  Sp  U  X  since  Tp  =  0,  is  unsatisfiable.  Each 
clause  in  X  is  of  form  L,*,  V  •  •  •  V  L,m  and  depends  on  the  unsatisfiability  of  a  set  of  clauses 
Some  of  these  clauses  K{j  may  be  in  T.  Then  the  remaining  clauses  K{k  are 
T-unsatisfiable;  the  clause  in  X  is  a  total  T-resolvent  of  the  clauses  Cti  and  is  thus  derivable 
by  total  wide  theory  resolution  with  theory  T.  Because  Sp  U  X  is  unsatisfiable,  it  can  be 
refuted  by  ordinary  resolution,  a  special  case  of  total  wide  theory  resolution.  Thus  S  has  a 
refutation  using  total  wide  theory  resolution  with  theory  T.  I 

There  are  limitations  to  the  use  of  total  theory  resolution.  The  requirement  that  the 
decision  procedure  for  the  theory  be  capable  of  determining  unsatisfiability  of  any  set  of 
clauses  using  predicates  in  the  theory  is  quite  strict.  Reasoning  about  sets  of  clauses  is 
probably  an  excessive  requirement  for  such  purposes  as  using  a  knowledge  representation 
system  as  a  decision  procedure  for  taxonomic  information,  since  such  systems  are  often 
weak  in  handling  disjunction.  This  tends  to  limit  total  resolution’s  applicability  to  build¬ 
ing  in  mathematical  decision  procedures  that  handle  disjunction.  For  example,  a  decision 
procedure  for  Presburger  arithmetic  (integer  addition  and  inequality)  might  be  adapted  to 
meet  the  requirements  for  total  theory  resolution. 

Some  care  must  be  taken  in  deciding  what  theory  T  to  incorporate.  The  theory  must  be 
capable  of  deciding  sets  of  clauses  that  are  constructed  by  using  any  predicates  appearing 
in  T.  Thus,  if  we  try  to  use  total  theory  resolution  to  build  in  the  equality  relation  with 
equality  substitutivity  (i.e.,  i  =  y3  (P(*  ••*•••)  D  P[ •  •  ■  y  ■  •  •))  for  each  predicate  P),  the 
decision  procedure  will  have  to  decide  all  of  S. 

There  may  be  a  large  number  of  T-unsatisfiable  keys  that  do  not  result  in  useful  T- 
resolvents.  It  would  be  a  worthwhile  refinement  to  monitor  the  finding  of  T-unsatisfiable 
sets  of  clauses  to  verify  that  the  substitutions  made  do  not  preclude  future  use  of  the 
T-resolvent.  This  is  like  applying  a  purity  check  in  A-ordered  resolution. 
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2.2  Narrow  Theory  Resolution 

Narrow  theory  resolution  is  a  form  of  theory  resolution  that  requires  a  less  complex  decision 
procedure  than  does  wide  theory  resolution.  Unlike  the  latter,  which  must  consider  T- 
unsatisfiability  of  sets  of  clauses,  narrow  theory  resolution  considers  T-unsatisfiability  of 
sets  of  literals.  Thus,  the  requirement  for  the  decision  procedure  for  T  to  handle  disjunction 
is  eliminated. 

Two  principal  variations  of  narrow  theory  resolution  are  considered  here:  total  narrow 
theory  resolution  and  partial  narrow  theory  resolution. 

In  total  narrow  theory  resolution,  if  C\ . Cm  is  a  set  of  clauses,  each  C,-  is  decomposed 

as  K{  V  L{,  where  K{  is  a  single  literal,  and  is  minimally  T-unsatisfiable,  then 

L\  V  •  •  •  V  Lm  is  a  T-resolvent  of  C j, . . . ,  Cm.  The  procedure  remains  sound  if  K\, . . . ,  Km 
is  nonminimally  T-unsatisfiable.  However,  completeness  generally  requires  the  absence  of 
extraneous  literals. 

Partial  narrow  theory  resolution  is  more  general  and  includes  total  narrow  theory  reso¬ 
lution  as  a  special  case. 

In  partial  narrow  theory  resolution,  if  is  a  set  of  clauses,  each  C,-  is  de¬ 

composed  as  Ki  V  Li  where  Ki  is  a  single  literal,  Ri,...,Rn  is  another  set  of  literals,  and 
Kit. Km,  Ri,. .. ,  Rn  is  minimally  T-unsatisfiable,  then  Li  V  •  •  •  V  Lm  V  ->Ri  V  -  •  •  V-i  Rn  is 

a  T-resolvent  of  Cj, . . .,  Cm.  Kx , . . . ,  Km  is  called  the  key  set  of  literals  or  key.  R\ . Rn 

are  conditions  for  the  T-unsatisfiability  of  K\, . . . ,  Kmi  i.e.,  if  n  >  0,  then  Ki, .. . ,  Km  is  not 
T-unsatisfiable,  but  K\, ,  Km  conjoined  with  R\,...,Rn  is.  -i/Ej,  . . . ,  ->Rn  is  called  the 
residue  set  of  literals  or  residue.  The  procedure  remains  sound  if  K\y  ■  •  • ,  Kmi  Ri  t  •  •  •  >  Rn  is 
nonminimally  T-unsatisfiable.  However,  completeness  again  generally  requires  the  absence 
of  extraneous  literals. 

Sometimes  it  is  onerous  to  have  to  decide  T-unsatisfiability  of  arbitrarily  large  sets 
of  literals,  as  total  narrow  theory  resolution  requires.  In  these  situations,  partial  narrow 
theory  resolution  can  be  used.  The  decision  procedure  for  T  must  recognize  the  potential 
T-unsatisfiability  of  a  set  of  literals,  i.e.,  the  set  of  literals  is  T-unsatisfiable  under  some 
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conditions.  The  partial  narrow  theory  resolvent  will  include  (as  the  residue)  the  negation 
of  the  conditions  for  the  T-unsatisfiability  of  the  set  of  literals  resolved  on  (the  key)  in  the 
partial  narrow  theory  resolution  operation.  The  residue  must  be  removed  by  later  resolution 
operations  for  the  resolvent  to  be  used  in  a  refutation.  Partial  narrow  theory  resolution 
thus  finds  T-unsatisfiable  sets  of  literals  incrementally  by  resolving  on  key  sets  of  literals, 
adding  residue  sets  of  literals  in  the  resolvent,  and  then  resolving  these  away  in  further 
resolution  operations. 

Since  partial  narrow  theory  resolution  will  often  examine  sets  of  clauses  that  are  smaller 
than  those  used  for  total  narrow  theory  resolution,  it  may  be  less  expensive  to  compute 
partial  narrow  theory  resolvents  than  total  narrow  theory  resolvents.  Offsetting  this  advan¬ 
tage  is  the  risk  that  more  of  the  resolvents  produced  will  be  useless  because  their  residues 
cannot  be  resolved  away. 

We  do  not  want  to  require  the  derivation  of  all  partial  narrow  theory  resolvents  permit¬ 
ted  by  the  definition.  Since  partial  narrow  theory  resolution  includes  total  narrow  theory 
resolution,  this  would  require  the  derivation  of  all  total  narrow  theory  resolvents,  thereby 
defeating  the  purpose  of  partial  theory  resolution. 

It  would  also  require  the  derivation  of  obviously  unnecessary  resolvents.  For  example, 
we  could  resolve  (a  <  b)  V  P  and  (c  <  d)  V  R,  since,  under  some  conditions  such  as 
(6  <  c)  A  (d  <  o),  a  <  b  and  e  <  d  are  T-unsatisfiable.  But  it  would  be  silly  to  draw  such 
inferences.  If  we  permit  inferences  from  a  <  b  and  e  <  d,  which  have  no  terms  in  common, 
the  theory  resolution  procedure  would  not  be  very  useful.  If  resolving  a  <  b  and  c  <  d  were 
to  actually  lead  to  a  refutation — i.e.,  conditions  for  their  T-unsatisfiability  do  hold — then 
some  of  these  conditions,  e.g.,  (b  <  c)  A  (d  <  a),  must  have  arguments  in  common  with 
a  <  b  and  c  <  d.  We  should  restrict  partial  theory  resolution  to  cases  in  which  the  literals 
are  suitably  related. 

To  justify  such  pragmatically  necessary  restrictions  on  theory  resolution,  we  offer  the 
following  criterion  for  the  selection  of  key  sets  of  literals  that  provides  a,  sufficient  condition 
for  the  completeness  of  partial  narrow  theory  resolution. 
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In  essence,  the  key  selection  criterion  requires  that  every  T -unsatisfiable  set  of  literals 
have  one  or  more  subset  key  sets  of  literals  that  can  be  T-resolved.  For  example,  in  theory 
ORD,  in  refuting  sets  of  positive  inequality  literals,  we  might  select  only  pairs  of  literals 
matching  x  <  y  and  y  <  z  as  key  sets  of  literals.  Thus,  in  refuting  the  set  {a  <  b,b  < 
c,c  <  d,d  <  a},  we  would  be  permitted,  for  example,  to  resolve  upon  a  <  b  and  b  <  c, 
but  not  a  <  b  and  c  <  d.  Key  sets  of  literals  have  one  or  more  residues  associated  with 
them  such  that  every  minimally  T-unsatisfiable  set  includes  a  key  with  a  residue  that  can 
be  refuted  by  resolving  away  the  literals  in  the  residue.  With  literals  matching  x  <  y  and 
y  <  z  selected,  it  is  sufficient  to  derive  T-resolvents  with  residue  x  <  z.  For  example,  a  <  b 
and  b  <  c  can  be  T-resolved  with  a  <  c  as  the  result.  This  can  then  be  resolved  with  c  <  d 
to  derive  a  <  d  that  can  be  resolved  with  d  <  a  to  derive  □. 

Key  selection  criterion. 

•  For  any  minimally  T-unsatisfiable  set  of  literals  S,  there  is  at  least  one  key  set  of 
literals  K  such  that  K  C  S.  K  has  at  least  two  literals  (one  literal  if  S  has  only  one 
literal).2  Each  K  is  recognizable  by  the  decision  procedure  for  T  and  will  comprise 
the  key  for  possible  theory  resolution  operations,  if  clauses  containing  the  key  literals 
are  present. 

•  For  any  such  key  set  of  literals  K,  there  is  at  least  one,  possibly  empty,  residue  set 

of  literals  R  such  that  K  U  ->R  is  minimally  T-unsatisfiable,  where  n R  denotes  the 
set  .. .  ,->i£n}  when  R  =  {/?i, . . Rn}.  Each  — is  a  Bet  of  conditions  for  the 

T-unsatisfiability  of  key  set  K.  Each  R  is  computed  from  K  by  the  decision  procedure 
for  T  and  is  used  as  a  residue  for  theory  resolution  operations  that  resolve  on  key  K. 

•  It  must  be  the  case  that,  for  some  key  set  of  literals  K  and  associated  residue  set  of 

literals  R,  (S  —  K)  U  {VR}  is  minimally  T-unsatisfiable,  where  VR  denotes  the  clause 
5The  requirement  that  K  have  at  least  two  literals  unless  S  has  only  one  literal  conveniently  eliminates 
the  possihie  derivation  of  unary  theory  resolvents  with  nonempty  residue.  The  use  of  such  valid  but 
unnecessary  resolvents  in  a  derivation  can  be  supplanted  by  the  use  of  higher-arity  theory  resolution 
operations. 
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R\  V  •••  V  Rn  when  R  =  {Ri, . . . ,  Rn).  This  ensures  that  key  selection  and  residue 
computation  will  be  sufficient  for  completeness — any  T-unsatisfiable  set  of  literals  S 
has  a  T-resolvent  using  a  key  K  C  S  and  residue  R  computed  from  K  such  that  the 
T-resolvent  is  contradicted  by  the  remaining  literals  S  —  K. 

In  total  narrow  theory  resolution,  we  uniformly  take  the  key  K  to  be  the  entire  minimally 
T-unsatisfiable  set  of  literals  S.  The  residue  R  is  always  empty. 

In  partial  narrow  theory  resolution,  we  will  try  to  minimize  the  number  of  residue 
sets  of  literals.  Thus,  for  K  =  {a  <  6,6  <  c}  we  might  have  residues  R\  =  {a  <  c}, 
R<i  =  {-i(e  <  a:i), -1(2:1  <  a) } ,  Rs  =  {->(c  <  21), -1(2:1  <  2;2),-i(z2  <  a)},  etc.  However,  only 
Ri  need  be  used,  since,  in  the  theory  T,  R\  implies  every  other  Ri.  R\  can  be  regarded  as 
the  strongest  consequence  of  a  <  b  and  b  <  c  in  theory  T. 

There  may  still  be  more  than  one  partial  theory  resolution  operation  with  the  same  key 
but  with  different  residues,  because  the  key  can  be  extended  to  a  T-unsatisfiable  set  in  more 
than  one  way.  For  example,  given 

triangU(at  b,  c) 
triangle(d,  e,f) 
length(a)  =  length(d) 
length(b)  =  length(e) 
langth(c)  =  lcngth(f) 

we  could  infer,  using  a  theory  of  geometry,  each  of 

angle(a,  b)  =  angle(d,  e) 
angle(a,c)  =  angle(d,  f) 
angle{b,e)  =  angle(e,f) 

as  residue. 

The  following  theorem  proves  the  completeness  of  narrow  theory  resolution  with  arbi¬ 
trary  selection  of  key  sets  of  literals  satisfying  the  key  selection  criterion.  The  completeness 
of  total  narrow  theory  resolution  follows  as  an  immediate  corollary  because  total  narrow 
theory  resolution  is  just  a  special  case  of  narrow  theory  resolution,  where  the  key  set  of 
literals  is  selected  to  be  the  entire  T-unsatisfiable  set  of  literals  and  the  residue  is  empty. 
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Theorem  9  Let  S  be  a  T -unsatisfiable  set  of  clauses.  Then  there  is  a  refutation  of  S 
(derivation  of  □  from  S )  using  partial  narrow  theory  resolution  with  theory  T  for  arbitrary 
selection  of  key  sets  satisfying  the  key  selection  criterion. 

Proof:  If  □  6  5,  then  S  is  trivially  refuted. 

Otherwise  we  will  prove  the  theorem  by  induction  on  complexity  measure  c(S),  where 
c(£)  =  (|S|,  k(S)),  where  |£|  is  the  number  of  clauses  in  S  and  k(S)  is  the  excess  literal 
parameter  [2].  The  excess  literal  parameter  is  defined  to  be  the  number  of  literals  (i.e., 
literal  occurrences)  in  S  minus  |S|.  The  ordering  of  e(S)  is  defined  by  c(5j)  <  e(S2)  iff 
|Si|  <  l^l,  or  |5, j  =  |S2|  and  fc(Sj)  <  k{S2). 

Case  c(S)  =  (m,0).  Every  clause  must  be  a  unit  clause.  Because  S  is  T-unsatisfiable, 
it  must  include  a  minimally  T-unsatisfiable  subset  S'. 

Subcase  |S'|  <  2,  By  the  key  selection  criterion,  S'  must  be  selected  as  a 
key.  The  empty  clause  □  is  derivable  in  a  single  unary  or  binary  T-resoIution 
step  from  S'  and  hence  from  S. 

Subcase  |£'|  >  2.  By  the  key  selection  criterion,  there  exists  a  key  K  C  S' 
with  | if |  >  2  and  (possibly  empty)  residue  R  such  that  S"  =  (S'  —  if)  U  { V i?} 
is  minimally  T-unsatisfiable.  c(S")  -<  c(S')  <  e(S).  Thus,  by  the  induction 
hypothesis,  □  is  derivable  from  S".  Since  Vif  is  a  T-resolvent  of  if  C  S,  □  is 
derivable  from  S. 

Case  e(£)  =  (m,n),  n  >  0.  Select  a  nonunit  clause  CBS.  Decompose  C  into  unit  clause 
A  and  clause  B,  i.e.,  C  =  AVfl.  Because  S  is  T-unsatisfiable,  both  Sa  =  (S  -  {C})  U  {A} 
and  Sfl  =  (S-  {C})U{fl}  are  T-unsatisfiable.  Both  c(Sx)  -<  c(S)  and  c(Sfl)  •<  c(S).  Thus, 
by  the  induction  hypothesis,  there  must  exist  derivations  of  □  from  each  of  Sa  and  Sb • 

Imitate  the  derivation  of  D  from  Sb,  using  C  instead  of  B.  The  result  will  be  a 
derivation  of  either  □  or  A  from  S.  In  the  latter  case,  extend  the  derivation  of  A  from  S 
to  a  derivation  of  □  from  S  by  appending  the  derivation  of  □  from  Sa-  i 
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Corollary  10  Let  S  be  a  T - unsatisfiable  set  of  clauses.  Then  there  is  a  refutation  of  S 
(derivation  of  O  from  S)  using  total  narrow  theory  resolution  with  theory  T. 

Although  the  theorem  proves  completeness  of  narrow  theory  resolution,  its  proof  does 
not  preclude  the  need  for  tautologies  in  a  refutation.  Indeed,  it  is  the  case  that  tautologies 
may  have  to  be  retained  for  a  refutation  to  be  found. 

Example  11  Let  T  be  the  theory  in  which  P,  Q,  and  R  are  all  equivalent.  Let  S  be 
{P  V  Q  V  R,-iP  V  ~<Q  V  -if?}.  There  is  a  single-step  wide  T-reBolution  refutation  of  S. 
However,  although  there  do  exist  refutations  of  5  by  narrow  T-resolution,®  all  require 
retention  of  tautologies,  since  all  narrow  T-resolvents  of  P  V  Q  V  72  and  ->P  V  -*Q  V  ->R  are 
tautologies. 

The  theorem  justifies  the  sufficiency  of  dealing  with  only  single  pairs  of  literals  at  a  time, 
i.e.,  using  binary  partial  narrow  theory  resolution.  In  this  type  of  theory  resolution,  every 
theory  resolvent  has  two  parents  (or  maybe  one,  with  no  residue),  provided  key  literals  are 
selected  in  conformance  with  the  key  selection  criterion. 

This  is  often  a  very  useful  instance  of  theory  resolution.  It  includes  resolving  on  comple¬ 
mentary  pairs  of  literals  (ordinary  binary  resolution)  and  on  T-unsatisfiable  pairs  of  literals. 
It  also  includes  formation  of  partial  theory  resolvents  for  selected  pairs  of  potentially  T- 
unsatisfiable  pairs  of  literals,  including,  for  example,  the  derivation  of  the  OPD-resolvent 
a  <  c  from  a  <  b  and  b  <  c.  Despite  the  usefulness  of  binary  partial  theory  resolution,  there 
will  be  occasions  when  it  is  better  to  use  nonbinary  partial  theory  resolution  to  result  in  a 
smaller  Bearch  space. 

Finally,  note  that  heuristic  restrictions  of  theory  resolution  (such  as  discarding  all  tau¬ 
tologies,  not  recognizing  all  caseB  of  T-unsatisfiability,  or  not  computing  all  residues),  though 
incomplete,  may  be  very  useful  in  practice. 

aFor  example,  resolve  on  P  in  (1)  P  V  Q  V  R  and  ->P  in  (2)  PV->QV  ->R  to  obtain  (3)  Q  V  R  V-iQV  -iff; 
then  resolve  on  P  in  (t)  and  ~iQ  in  (3)  to  obtain  (4)  QV  RV  -iff;  finally  resolve  on  P  in  (1)  and  ->R  in 
(4)  to  obtain  (5)  QV  R.  There  is  a  similar  derivation  of  (6)  ->Q  V  -»Jf.  Just  as  (5)  and  (6)  were  derived 
from  (1)  and  (2),  the  contradictory  unit  clauses  R  and  ->R  can  be  derived  from  (5)  and  (6). 
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2.3  Theory  Matings 

The  theory  matings  procedure  is  another  method  of  incorporating  theories  that  is  similar  to 
the  total  narrow  theory  resolution  method,  in  the  sense  of  imposing  the  same  requirements 
on  the  decision  procedure  for  T,  i.e.,  determining  T-unsatisfiability  of  sets  of  literals,  but 
that  does  not  depend  on  performing  resolution  inference  operations.  Hence  it  also  overcomes 
the  difficulty  in  total  narrow  theory  resolution  of  retention  of  tautologies. 

The  theory  matings  procedure  is  an  extension  of  Andrews’s  matings  procedure  [3]  (see 
also  Bibel’s  connection  method  [5]). 

Definition  12  Let  {m  >  1)  be  a  set  of  clauses.  Then  each  set  of  literals 

Ki, . . . ,  Km  such  that  each  K;  is  a  literal  of  <7,-  is  a  path  through  Cx, . . . ,  Cm. 

A  path  consists  of  one  literal  from  each  clause;  it  can  also  be  regarded  as  one  row  of  the 
dual,  disjunctive  normal  form  of  the  set  of  clauses. 

The  most  important  theorem  for  the  application  of  the  matings  procedure  is  the  follow¬ 
ing. 

Theorem  IS  Let  S  be  a  set  of  clauses.  Then  S  is  unsatisfiable  iff  every  path  through  S 
contains  a  complementary  pair  of  literals. 

This  can  be  easily  extended  to  the  theory  matings  procedure  that  builds  in  a  theory  T. 

Theorem  14  Let  S  be  a  set  of  clauses.  Then  S  is  T -unsatisfiable  iff  every  path  through  S 
contains  a  T -unsatisfiable  set  of  literals. 

Proof: 

If  part.  Soundneaa.  Assume  that  every  path  through  S  contains  a  T-unsatisfiable  set 
of  literals  and  is  thus  T-unsatisfiable.  For  S  not  to  be  T-unsatisfiable,  there  must  be  a 
T-interpretation  /  that  satisfies  each  clause  in  S.  To  satisfy  each  clause  in  S,  I  must  satisfy 
at  least  one  literal  of  the  clause.  It  must  thus  satisfy  at  least  one  path  through  S.  But  since 
every  path  is  T-unsatisfiable,  there  is  no  such  I.  Hence  S  is  T-unsatisfiable. 
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Only  if  part.  Completeness.  Assume  that  S  is  T-unsatisfiable.  Suppose  some  path 
through  S  does  not  contain  a  T-unsatisfiable  set  of  literals.  Then  that  path  has  a  T- 
interpretation  I  that  satisfies  it.  But  I  would  also  satisfy  S,  since  it  satisfies  a  literal  of 
each  clause  of  S.  This  contradicts  the  assumption  that  S  is  T-unsatisfiable.  Hence  every 
path  through  S  must  contain  a  T-unsatisfiable  set  of  literals.  I 

2.4  Metatheory  Resolution 

We  regard  the  “black  box”  nature  of  the  decision  procedure  being  built  in  by  means  of 
theory  resolution  to  be  an  important  aspect  of  theory  resolution  because  it  allows  flex¬ 
ibility  in  implementation  of  the  decision  procedure  by  not  requiring  it  to  employ  clause 
representation,  resolution,  or  other  arbitrary  restrictions. 

However,  one  interesting  possibility  for  implementation  of  decision  procedures  for  theory 
resolution  entails  using,  in  the  decision  procedure,  a  resolution  theorem  prover  whose  domain 
of  discourse  is  the  language  of  the  outer  theorem  prover. 

We  could  axiomatize  the  T-unsatisfiability  relation  and  use  these  axioms  together  with 
information  on  what  literals  are  present  to  identify,  by  means  of  resolution,  prospective  key 
sets  of  literals  to  resolve  upon.  This  reasoning  system  could  itself  depend  on  additional 
layers  of  theory  resolution  to  establish  its  own  inference  operations. 

This  procedure  of  having  successive  layers  of  deduction  systems  determining  unsatisfia- 
bility  in  successive  theories  in  order  to  apply  resolution  operations  could  be  called  metatheory 
resolution. 

3  Examples  of  Theory  Resolution 

Theory  resolution  is  a  procedure  with  substantial  generality  and  power.  Thus,  it  is  not 
surprising  that  many  specialized  reasoning  procedures  can  be  viewed  as  instances  of  theory 
resolution,  perhaps  with  additional  constraints  governing  which  theory  resolvents  can  be 
inferred.  We  believe  that  the  success  of  these  specialized  reasoning  procedures  helps  to 
validate  the  concept  of  theory  resolution. 
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First  of  all,  we  should  note  that  there  is  a  relationship  between  theory  resolution  and 
hyperresolution.  Although  further  constraints  (e.g.,  on  the  polarity  of  the  literals)  are 
often  prescribed,  the  essence  of  hyperresolution  is  the  derivation  of  L\  V  •  •  •  V  Lm  V  R  from 
the  electron  clauses  if.VL,-,  where  K{  is  a  literal  and  L,-  is  a  [possibly  empty]  clause  and  the 
nucleus  clause  ->Ki  V  ■  •  •  V-> Km  V  R,  where  R  is  a  [possibly  empty]  clause.  This  corresponds 
to  a  theory  resolution  operation  using  theory  T,  where  ->K\  V-  •  -V->KmV  R  is  a  consequence 
of  T,  Km  is  the  key  set  of  literals,  and  R  is  the  residue. 

Theory  resolution  is  also  related  to  procedural  attachment  [23],  whereby  expressions 
are  “evaluated”  to  produce  new  expressions.  Ordinary  procedural  attachment  can  be  re¬ 
garded  as  unary  theory  resolution.  Theory  resolution  in  general  can  be  considered  as  an 
extension  of  the  notion  of  procedural  attachment  to  sets  of  literals.  Where  ordinary  proce¬ 
dural  attachment  permits  the  replacement  of  2  <  3  by  true,  theory  resolution,  in  effect,  can 
attach  a  procedure  to  the  <  relation  that  permits  derivation  of  a  <  c  from  a  <  b  and  b  <  c. 

Two  previous  refinements  of  resolution  that  resemble  partial  theory  resolution  are  Z- 
resolution  and  U-generaiized  resolution. 

Dixon’s  Z-reBolution  [13]  is  essentially  binary  total  narrow  theory  resolution  with  the 
restriction  that  T  must  consist  of  a  finite  deductively  closed  set  of  2-clauses  (clauses  with 
length  2).  This  restriction  does  not  permit  inclusion  of  assertions  like  V  <5(/(i)), 

-1(1  <  x),  or  (a:  <  y)  A  (y  <  z)  D  (x  <  z),  but  does  permit  efficient  computation  of  T- 
resolvents  (even  allowing  the  possibility  of  compiling  T  to  LISP  code  and  thence  to  machine 
code).  Z-factoring  and  Z-subsumption  operations  are  also  defined. 

Harrison  and  Rubin’s  U-generallzed  resolution  [15]  is  essentially  binary  partial  nar¬ 
row  theory  resolution  applied  to  seta  of  clauses  that  have  a  unit  or  input  refutation.  They 
apply  it  to  building  in  the  equality  relation,  developing  a  procedure  similar  to  Morris’s  E- 
resolution  [21].  The  restriction  to  sets  of  clauses  having  unit  or  input  refutations  eliminates 
the  need  for  factoring  and  simplifies  the  procedure,  but  otherwise  limits  its  applicability. 
No  effort  was  made  in  the  definition  of  U-generalized  resolution  to  limit  the  applicability  of 
T-resolution  to  reasonable  cases  (e.g.,  formation  of  an  Of?D-resolvent  of  a  <  b  and  c  <  d  is 
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permitted  by  the  definition). 

The  linked  inference  principle  by  Wos  et  al.  [40]  is  related  to  theory  resolution 
in  concept  and  purpose.  The  linked  inference  principle  is  a  somewhat  more  conservative 
extension  of  resolution  than  theory  resolution,  since  it  stipulates  that  the  theory  will  be  built 
in  by  means  of  clauses  designated  as  linking  clauses.  Theory  resolution,  on  the  other  hand, 
allows  the  theory  to  be  incorporated  as  a  “black  box”  that  determines  T-unsatisfiability 
questions  in  an  unspecified  manner.  This  facilitates  the  use  of  other  systems,  which  do 
not  rely  upon  resolution  or  clause  representation,  to  build  in  theories.  Nevertheless,  many 
instances  of  theory  resolution  can  be  usefully  implemented  in  the  manner  of  the  linked 
inference  principle.  Since  the  implementation  proposal  for  the  linked  inference  principle 
is  more  concrete,  Wos  et  al.  have  expended  comparatively  more  effort  in  determining  how 
inference  using  the  linked  inference  principle  is  to  be  controlled,  including  defining  linked 
variants  of  resolution  refinements  such  as  unit-resulting  resolution  and  hyperresolution. 

Paramodulatlon  [39]  can  be  viewed  as  an  instance  of  binary  partial  theory  resolution 
where  T  is  the  theory  of  equality  EQ.  P(b)  is  an  EQ-resolvent  (paramodulant)  of  P(a)  and 
o  =  6.  Digricoli’s  resolution  by  unification  and  equality  [12]  can  be  viewed  similarly. 
a  ^  6  is  an  EQ-resolvent  (RU E-resolvent)  of  P(a)  and  -iP(6).  Morris's  E-resolutlon 
[21]  can  be  viewed  as  an  instance  of  total  theory  resolution,  n  is  a  total  EQ-resolvent 
(E-resolvent)  of  P(a),  ->P( 6),  and  a  =  b. 

The  difference  between  these  equality  reasoning  procedures,  from  the  standpoint  of 
theory  resolution,  is  the  selection  of  the  key  set  of  literals  for  theory  resolution  operations. 
The  minimally  EQ-unsatisfiable  set  of  literals  P(a),  ->F(6),  and  a  =  6  was  used  for  all  three 
examples.  In  the  paramodulation  example,  P(a)  and  a  =  6  were  used  as  the  key  and  P(b) 
was  derived  as  the  residue.  In  the  resolution  by  unification  and  equality  example,  P(a) 
and  ->P{b)  were  used  as  the  key  and  o  b  was  derived  as  the  residue.  In  the  E-resolution 
example,  the  key  set  consisted  of  all  of  P(a),  -iP(6),  and  a  =  b,  and  the  residue  was  empty. 

Where  T  consists  of  ordering  axioms,  including  axioms  that  show  how  ordering  is  pre¬ 
served  (such  as  (x  <  y)  D  (Pos(x)  D  Pos(y))  and  (x  <  y)  D  (z  +  z  <  y  +  z)),  T-resolution 
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operations  include  Manna  and  Waldinger’s  relation  replacement  rule  (e.g.,  Pos(b)  can 
be  inferred  from  Poa(a)  and  (o  <  6)  and  relation  matching  rule  [20]  (e.g.,  ->(a  <  6) 
can  be  inferred  from  Poa(a)  and  ->Pos(b)),  which  are  extensions  of  paramodulation  and 
resolution  by  unification  and  equality  respectively.  T-resolution  with  ordering  axioms  is 
also  similar  to  Slagle  and  Norton’s  reasoning  with  partial  ordering  inference  rules  [30]. 
Bledsoe  and  Hines’s  variable  elimination  and  chaining  [7]  is  a  refined  method  for  rea¬ 
soning  about  inequalities  that  can  be  viewed  partly  as  theory  resolution  for  inequality  with 
added  constraints  on  theory  resolution  operations.  The  0/2Z)-resolvent  a  <  c  of  a  <  b 
and  b  <  c  is  a  variable-elimination-procedure  chain  resolvent  only  if  b  is  a  shielding  term 
(nonground  term  headed  by  an  uninterpreted  function  symbol).  The  variable-elimination 
rule  allows  inferring  O/ZD-resolvent  (a  <  6)  V  C  from  clause  (a  <  x)  V  (a:  <  b)  V  C  only  if  x 
does  not  occur  in  o,  b,  or  C,  It  more  generally  allows  replacement  of  multiple  literals  a,-  <  x 
and  x  <  bj  in  a  clause  by  literals  a,-  <  bj.  This  result  is  obtainable  by  theory  resolution 
if  we  include  the  axiom  -<(x  <  min{x,y))  and  a  rule  to  transform  min(a,-,,  o,-3)  <  bj  to 
(a;,  <  by)  V  (a,3  <  bj). 

Allen’s  interval-based  temporal  logic  [1]  is  another  prospective  system  for  explana¬ 
tion  and  implementation  by  theory  resolution.  Allen  enumerates  the  13  mutually  exclusive 
relations  that  can  hold  between  any  pair  of  intervals:  before,  after,  during,  contains,  over¬ 
laps,  overlapped-by,  meets,  met-by,  starts,  started-by,  finishes,  finished-by,  and  equal.  For 
each  pair  of  temporal  relations  ry  and  r2  and  intervals  A,  B,  and  C  such  that  rj(A,  B)  and 
ri(B,C),  interval-based  temporal  logic  defines  what  temporal  relations  are  possible  between 
A  and  C.  For  example,  if  A  meets  B  and  B  is  during  C,  then  either  A  overlaps  C,  A  is 
during  C,  or  A  starts  C.  This  corresponds  to  the  theory  resolution  operation  of  resolving 
during(A,  B)  and  ovcrlaps(B,  C )  to  obtain  during{A,  C)  V  overlaps(A,  C)  V  starts(A,  C). 

It  is  useful  to  create  predicates  that  denote  each  of  the  2,s  combinations  of  possible 
temporal  relations  between  a  pair  of  intervals  so  that  uncertainty  about  which  relation 
holds  will  be  represented  by  a  predicate  rather  than  a  disjunction.  Thus,  during(A,  B) 
and  overlaps(B,C)  would  be  resolved  to  obtain  [during\overlaps\starts)(A,  C).  With  this 
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approach,  it  is  possible  to  relate  all  the  possible  relations  between  A  and  C  at  once  with 
other  relations  mentioning  A  or  C  by  using  narrow  instead  of  wide  theory  resolution. 

Another  application  of  theory  resolution  is  Konolige’s  resolution  for  modal  logic 
of  belief  [16].  Among  other  things,  he  defines  a  resolution  procedure  that  eliminates 
modal  belief  literals  by  recognizing  their  unsatisfiability  in  a  subordinate  deduction  and 
then  resolving  on  them  in  the  manner  of  theory  resolution.  A  simple  propositional  example 
of  this  is  the  resolution  of  DpVA,  n(p  g)V  B,  and  -iD  qVC  to  obtain  AV  BS/C,  since  the 
conjunction  of  Dp,  m(p  D  q),  and  ->Dq  is  unsatisfiable.  For  reasoning  in  his  modal  logic 
of  belief,  Konolige  envisages  using  a  system  organized  on  principles  similar  to  metatheory 
resolution. 

We  have  already  suggested  the  importance  of  theory  resolution  for  taxonomic  reason¬ 
ing.  This  is  being  explored  in  the  KRYPTON  knowledge  representation  system  [8,25].  Fig¬ 
ure  1  contains  a  nearly  verbatim  transcription  of  a  proof  using  KRYPTON-style  reasoning.4 
The  problem  is  to  prove  that,  if  Chris  has  no  sons  and  no  daughters,  then  Chris  has  no 
children. 

The  terminological  information  used  in  this  problem  through  theory  resolution  includes 
the  statements  that  boys  are  persons  whose  sex  is  male;  girls  are  persons  whose  sex  is  female; 
“no-sons”  are  persons  all  of  whose  children  are  girls;  “no-daughters”  are  persons  all  of  whose 
children  are  boys.  Relevant  portions  of  this  information  are  included  in  Formulas  1-6,  which 
are  used  to  define  what  theory  resolution  operations  are  possible.  If  complements  of  the 
first  two  atoms  of  each  formula  can  be  found,  they  can  be  resolved  upon,  and  the  remaining 
part  of  the  formula,  if  any,  would  be  derived  as  the  residue.  Thus,  Formula  1  expresses 
the  unsatisfiability  of  Boy(John)  and  ->Person(J ohn).  Formula  5  permits  the  derivation 
of  Girl(Sandy)  from  NoSon[Mary)  and  Child[M  ary,  Sandy).  These  formulas  behave 
similarly  to  linking  clauses  in  linked  inference  [40]. 

'‘This  proof  was  done  by  the  KLAUS  deduction  system[14,31]  rather  than  KRYPTON  however.  Pigman  [25] 
presents  an  actual  KRYPTON  proof  of  this  problem,  but  it  differs  from  this  one,  since  different  decisions 
were  made  as  to  what  theory  resolution  operations  to  perform  using  the  same  terminological  information. 
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2-ary  rule 

1. 

Boy{x)  D  Person(x) 

2-ary  rule 

2. 

[Boy(x)  A  Sex(x,  y)]  D  Male(y) 

2-ary  rule  (not  used) 

3. 

Girl(x)  D  Person(x) 

2-ary  rule 

4. 

[C?ir/(:r)  A  Sex(x,  y)]  D  Female(y) 

2-ary  rule 

5. 

[yVoS'on(i)  A  Child(x,  y)]  D  Girl(y ) 

2-ary  rule 

6. 

[NoDaughter(x)  A  Child(x,y)]  D  Boy(y) 

7. 

Person(x)  D  Sex(x,  s/fcl(x)) 

8. 

Male(x)  =  -i  Female(x) 

9. 

NoSon(Chris) 

10. 

NoDaughter(Chris ) 

negated  conclusion 

11. 

Child(Chrts,  sk2) 

resolve  11  and  9  using  5 

12. 

Gtrl(sk2) 

resolve  11  and  10  using  6 

13. 

Boy{sk2) 

resolve  13  and  7  using  1 

14. 

Sex(sk2,  skl(sk2)) 

resolve  13  and  14  using  2 

15. 

Male(skl(sk2)) 

resolve  12  and  14  using  4 

16. 

F  emale{skl[sk2)) 

resolve  16  and  8,  simplify  by  15 

i  20. 

□ 

Figure  1:  KRYPTON-style  Proof 


The  assertional  information  used  in  this  problem  includes  the  information  that  every 
person  has  a  sex;  males  and  females  are  disjoint;  Chris  has  no  sons  and  no  daughters.  From 
these  facts,  and  the  built  in  terminological  information,  a  refutation  is  completed  starting 
with  the  negation  of  the  desired  conclusion  that  Chris  has  no  children,  ski  and  sk2  are 
Skolem  functions. 

The  following  table  compares  the  statistics  for  proofs  completed  with  and  without  For¬ 
mulas  1-6  built  in  through  theory  resolution.  The  proof  strategies  used  and  meaning  of  the 
statistics  are  essentially  the  same  as  described  in  Section  4. 


Built  in 

Inputted 

Derived 

Retained 

Successful 

Time 

Proof 

Axioms 

Formulas 

Formulas 

Formulas 

Unifications 

(seconds) 

Length 

none 

11 

10 

20 

33 

1.0 

9 

1-6 

5 

9 

11 

24 

6 

There  is  a  noticeable  improvement  resulting  from  using  theory  resolution,  but  because 
the  problem  is  so  small,  the  difference  is  not  large.  Harder  problems  (like  the  one  in 
Section  4)  can  be  used  to  demonstrate  much  greater  improvement. 

Theory  resolution  for  taxonomic  reasoning  also  incorporates  many  elements  of  reasoning 
in  a  many-sorted  logic.  For  example,  in  Walther’s  ERP-calculus -{many-sorted  resolution 
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and  paramodulation)  [33,35],  sort  declarations,  subsort  relationships,  and  sort  restrictions 
on  clauses  are  all  incorporated  into  the  unification  procedure,  and  eliminated  from  the 
clauses  in  the  statement  of  a  problem.  Thus,  the  ERP  unification  procedure  implements  a 
theory  of  sort  information. 


4  Experimental  Results 


Although  the  relationship  of  theory  resolution  to  many  other  extensions  of  resolution  (as 
discussed  in  the  preceding  section)  and  experience  with  numerous  small  examples  support 
the  practical  value  of  theory  resolution,  we  will  not  elaborate  on  these,  but  will  rather 
bolster  our  claim  with  an  examination  of  experimental  results  for  “Schubert’s  Steamroller” 
challenge  problem. 

Schubert’s  steamroller  problem  is 

Wolves,  foxes,  birds,  caterpillars,  and  snails  are  animals,  and  there  are  some  of 
each  of  them.  Also  there  are  some  grains,  and  grains  are  plants.  Every  animal 
either  likes  to  eat  all  plants  or  all  animals  much  smaller  than  itself  that  like  to 
eat  some  plants.  Caterpillars  and  snails  are  much  smaller  than  birds,  which  are 
much  smaller  than  foxes,  which  in  turn  are  much  smaller  than  wolves.  Wolves 
do  not  like  to  eat  foxes  or  grains,  while  birds  like  to  eat  caterpillars  but  not 
snails.  Caterpillars  and  snails  like  to  eat  eome  plants.  Therefore  there  is  an 
animal  that  likes  to  eat  a  grain-eating  animal. 

In  reporting  a  solution  to  this  problem,  Walther  [34]  states,  “this  problem  became  well 
known,  since,  in  spite  of  its  apparent  simplicity,  it  turned  out  to  be  too  hard  for  existing 
theorem  provers  because  the  search  space  is  just  too  big.”  We  will  discuss  his  and  other 
solutions  at  the  end  of  this  section. 

An  English-language  solution  to  the  problem  is 

Every  animal  either  likes  to  eat  all  plants  or  all  animals  much  smaller  than  itself 
that  like  to  eat  some  plants.  Snails  are  much  smaller  than  birds  and  like  to  eat 
some  plants.  Because  birds  do  not  like  to  eat  snails,  it  must  be  the  case  that 
birds  like  to  eat  all  plants,  including  grains.  Wolves  do  not  like  to  eat  grain. 
Therefore  they  must  like  to  eat  all  animals  much  smaller  than  themselves  that 
like  to  eat  some  plants.  Because  foxes  are  much  smaller  than  wolves  and  wolves 
do  not  like  to  eat  foxes,  it  must  be  the  case  that  foxes  do  not  like  to  eat  plants. 
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1.  Wolf(x)  3  Animal(x) 

2.  Fox(x)  3  Animal{x) 

3.  Bird[x )  3  Animal(x) 

4.  Caterpillar (x)  3  Animal{x) 

5.  Snail(x)  3  Animal(x) 

6.  Gratn(x)  3  Plant(x) 

7.  Wolf(a-wolf) 

8.  Fox(a-fox) 

9.  Bird(a-bird) 

10.  Caterpillar(a-cater  pillar) 

11.  Snail(a-snail) 

12.  Grain(a-grain) 

13.  Animal[x)  3  [(P/tmt(y)  3  Likea-to-tat(x,y )] 

V[\Animal(z)  A  Much- smaller (z,x)  A  Plant(w)  A  Likcs-to-eat(z,w)] 

3  Likcs-to-eat(x,  z)]\ 

14.  [[f?ird(y)  A  [5nai7(x)  V  Caterpillar  (x)]]  V  [Bird(x)  A  Fox(y)] 

V[Fox(x)  A  WoZ/fy)]]  3  Mvch-smaller{x,y) 

15.  [(IVoZ f(x)  A  [Fox(yj  V  Gra«n(y)]]  V  [Bird(x)  A  Sna«7(y)]]  3  -'LiJtes-lo-cal(x,y) 

16.  [Bird(x)  A  Ga<erpi7/ar(y)]  3  Likes-to-eat\x,  y) 

17.  [GaterpiZZar(x)  V  Sna»7(x)]  3  [F/anl(ail(x))  A  Li£ea-to-eat(x,sil(x))] 

18.  -iAnimal(x)  V  -lAmmaZfy)  V  -iGrain(z)  V  ->Likcs-to-cat[y,  z)  V  -iLikes-to-eat(x,  y) 


Figure  2:  Steamroller  Axioms 

Therefore  foxes  like  to  eat  birds  since  birds  are  grain-eating  animals  that  are 
much  smaller  than  foxes. 

Our  formulation  of  Schubert’s  steamroller  problem  appears  in  Figure  2.  The  objective 
is  to  refute  Formula  18  by  Formulas  1-17.  We  UBed  a-tuolf,  a- fox,  etc.,  in  Formulas  7-12 
as  the  Skolem  constants  introduced  by  the  assertions  3 x.Wolf(x),  3 x.Fox(x),  etc.  ski  in 
Formula  17  is  a  Skolem  function. 

We  present  statistics  on  several  solutions  of  Schubert’s  steamroller  problem  found  by 
our  theorem  prover  [31],  The  first  is  a  proof  that  does  not  use  theory  resolution;  the  second 
is  a  proof  using  theory  resolution  to  implement  the  taxonomic  information  in  the  problem 
(Formulas  1-6);  the  remaining  proofs  show  the  results  of  using  theory  resolution  to  build 
in  each  of  Formulas  14-17  successively. 

The  same  strategy  was  used  for  all  of  the  proofs.  Nonclausal  connection-graph  reso¬ 
lution  was  the  principal  inference  rule.  Factoring  was  not  employed.  Pure,  variant,  and 
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tautojogous  formulas  were  eliminated. 

Single  literal  formulas  were  used  for  both  forward  and  backward  demodulation.  In  a 
technical  sense,  however,  this  is  not  demodulation — which  is  really  a  rule  for  simplification 
by  equalities — but  it  performs  a  similar  function.  For  example,  if  A  is  an  atomic  assertion, 
it  could  be  used  to  simplify  formulas  containing  atom  A1,  an  instance  of  A,  by  replacing 
A'  by  true  in  the  formula.  Similarly,  ->A  could  be  used  to  replace  instances  A'  of  A  by 
false.  These  could  be  construed  as  demodulation  by  the  equivalences  A  =  true  and  A  = 
false,  respectively.  Depending  on  the  polarity  of  the  atoms,  the  effect  is  essentially  unit 
subsumption  by  the  single  literal  formula  or  unit  resolution  with  it. 

Heuristic  search,  guided  by  a  simple  weighted  function  of  the  deduction  level  of  the 
parents  and  the  expected  size  of  the  resolvent,  was  used  to  decide  which  inference  operation 
should  be  performed  next.  The  set  of  support  strategy  (with  only  Formula  18  supported) 
and  an  ordering  strategy  that  designated  which  atoms  in  a  formula  could  be  resolved  upon 
were  used  to  limit  the  number  of  alternative  inference  operations. 

In  using  theory  resolution,  connection  graph  links  were  created  from  key  sets  of  literals 
in  the  theory  being  incorporated.  Formulas  1-6  and  17  were  implemented  by  binary  total 
narrow  theory  resolution  links  and  Formulas  14-16  were  implemented  by  3-ary  total  nar¬ 
row  theory  resolution  links.  For  example,  Wolf(t)  and  -i Animal(t)  could  be  linked,  and 
Bird(ti),  Snail(ti),  and  Likes-to-eai{t\,  t-f)  could  all  be  linked.  Theory  resolution  was  also 
used  in  demodulation — e.g.,  W ol f{t)  could  be  used  to  demodulate  Animal(t)  to  true.  This 
was  accomplished  by  automatically  adding  extra  demodulators  such  as  Animal[t)  =  true 
for  Wolf{t). 

Following  are  the  statistics  for  the  various  solutions  of  Schubert’s  steamroller  problem. 
Included  in  the  statistics  are  the  number  of  formulas  inputted  to  the  theorem  prover,  the 
number  of  formulas  derived  in  the  course  of  searching  for  a  proof,  the  number  of  inputted  and 
derived  formulas  still  present  when  a  proof  was  found,  the  number  of  successful  unification 
attempts  during  the  search  for  a  proof  (including  unification  during  link  inheritance),  the 
time  required  for  the  proof  (on  a  Symbolics  3600  personal  LISP  machine),  and  the  length 
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of  the  proof  in  resolution  steps. 


Built  In 

Inputted 

Derived 

Retained 

Successful 

Time 

Proof 

Axioms 

Formulas 

Formulas 

Formulas 

Unifications 

(h:mm) 

Length 

none 

18 

2,717 

595 

216,987 

2:53 

59 

1-6 

12 

889 

246 

44,928 

37 

1-6, 14 

11 

68 

0:01.3 

32 

1-6,  14-15 

10 

63 

4,555 

0:01.1 

32 

1-6,  14-16 

9 

212 

57 

32 

1-6,  14-17 

8 

262 

24 

7,711 

0:01.6 

24 

MKRP 

27 

60 

83 

0:04.4 

55 

MKRP  ERP 

12 

10 

13 

00 

c* 

9 

ITP 

0:06 

Thus,  very  substantial  reductions  in  both  the  size  of  the  search  space  and  the  length  of 
the  proof  resulted  from  building  in  some  of  the  axioms  by  theory  resolution. 

Also  included  in  the  table  are  statistics  we  know  for  solutions  of  Schubert’s  steamroller 
problem  by  other  systems. 

The  MKRP  solution  was  done  by  Walther  [36]  using  the  Markgraf  Karl  Refutation 
Procedure  [6].  This  proof  relied  heavily  on  the  MKRP  TERMINATOR  module  [4],  which  is 
essentially  a  very  fast  procedure  for  finding  unit  refutations.  A  superior  proof  by  Walther 
[34]  used  his  ERP  calculus  [33,35]  in  the  MKRP  system  to  perform  many-sorted  resolution 
on  a  much  reduced  set  of  clauses.  This  proof  also  used  the  TERMINATOR  module,  but, 
given  the  reduction  in  the  number  of  clauses  and  literals  made  possible  by  using  many- 
sorted  resolution  and  its  restrictions  on  unification,  here  its  use  was  not  essential  to  finding 
a  solution  with  reasonable  effort.  MKRP  is  written  in  INTERLISP  and  was  run  on  a  Siemens 
7760  computer. 

Our  first  theory  resolution  proof,  in  which  only  the  taxonomic  information  of  Formu¬ 
las  1-6  is  incorporated,  has  some  similarity  to  a  many-sorted  resolution  proof.  In  the  MKRP 
ERP  proof,  Wolf ,  Fox,  Bird,  Caterpillar,  and  Snail  were  declared  to  be  subsorts  of  sort  An¬ 
imal  and  Grain  was  declared  to  be  a  subsort  of  sort  Plant.  The  unification  algorithm  was 

restricted  so  that  a  variable  can  be  unified  with  a  term  if  and  only  if  the  term  is  a  subsort 
6This  figure  is  for  a  solution  without  factoring  (like  ours)  and  excludes  unifications  performed  inside  the 
Terminator  module. 
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of  or  equals  the  sort  of  the  variable.  For  building  in  just  this  taxonomic  information,  many- 
sorted  resolution  is  stronger  than  this  particular  instance  of  theory  resolution.  Although 
theory  resolution  handies  the  sort  literals  more  effectively  than  ordinary  resolution,  many- 
sorted  resolution  dispenses  with  them  entirely.  Also,  many-sorted  resolution  is  used  to  build 
in  the  sort  information  for  Skolem  constants  and  functions  so  that,  in  Schubert’s  steamroller 
problem,  Formulas  7-12  are  supplanted  by  type  declarations.  This  would  have  the  effect  of 
eliminating  the  formation  of  atoms  like  Wolf  {a- fox)  (if  the  predicate  Wolf  were  used  at 
all).  In  somewhat  similar  fashion,  we  have  been  successful  in  substantially  decreasing  the 
effort  required  to  solve  Schubert’s  steamroller  problem  by  adding  sort  disjointness  informa¬ 
tion  through  theory  resolution,  so  that,  for  example,  Wolf  (a-  fox)  is  evaluated  to  be  false. 
However,  this  is  really  an  extension  of  Schubert’s  steamroller  problem,  and  it  is  conceivable 
for  the  disjointness  information  to  contribute  to  an  invalid  solution  of  the  problem,  instead 
of  it  just  being  used  to  reduce  the  size  of  the  search  space. 

The  ITP  solution  was  found  by  the  automated  reasoning  system  ITP  (written  in  PAS¬ 
CAL)  developed  at  Argonne  National  Laboratory  [19].  This  solution  used  qualified  hyper- 
resolution  [18,37]  and  was  completed  in  about  six  minutes  on  a  VAX  11/780  computer  [24]. 
Like  the  theory  resolution  and  MKRP  ERP  solutions,  this  solution  treated  the  taxonomic 
sort  information  in  the  problem  specially.  In  qualified  hyperresolution,  some  literals  in  a 
clause  can  be  designated  as  qualifier  literals  that  contain  “conditions  of  definition”  for  terms 
appearing  in  the  clause.  Qualifier  literals  are  ignored  during  much  of  the  inference  process — 
e.g.,  a  clause  consisting  of  a  single  nonqualifier  literal  and  some  qualifier  literals  is  handled 
as  if  it  were  a  unit  clause — with  the  conditions  imposed  by  the  qualifier  literals  checked 
only  after  the  qualified  terms  are  instantiated.  Thus,  sort  restrictions  can  be  specified  in 
qualifier  literals  and  deductions  can  be  performed  using  only  the  nonsort  information.  The 
deductions  are  then  subjected  to  verification  that  terms  are  of  the  correct  sort. 
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5  Conclusion 


Theory  resolution  is  a  set  of  complete  procedures  for  incorporating  decision  procedures  into 
resolution  theorem  proving  in  first-order  predicate  calculus.  Theory  resolution  can  greatly 
decrease  the  length  of  proofs  and  the  size  of  the  search  space.  Total  theory  resolution  can 
be  used  when  there  exists  a  decision  procedure  for  the  theory  that  is  capable  of  determining 
unsatisfiability  of  any  set  of  clauses  using  predicates  of  the  theory.  This  may  be  a  realistic 
requirement  in  some  mathematical  theorem  proving. 

Partial  theory  resolution  requires  much  less  of  the  decision  procedure.  It  requires  only 
that  conditions  for  unsatisfiability  of  sets  of  literals  be  determinable  by  the  decision  pro¬ 
cedure  for  the  theory.  This  makes  it  feasible,  for  example,  to  consider  use  of  a  knowledge 
representation  system  as  the  decision  procedure  for  taxonomic  information. 

Theory  resolution  is  also  a  generalization  of  several  other  approaches  to  building  in 
nonequational  theories. 

We  are  implementing  and  testing  forms  of  theory  resolution  in  the  deduction-system 
component  of  the  KLAUS  natural-language-understanding  system  [14,31].  This  system 
demonstrated  substantial  improvement  in  performance  when  theory  resolution  was  used  on 
Schubert’s  steamroller  challenge  problem.  The  KRYPTON  knowledge  representation  system 
[8,25]  is  also  applying  the  ideas  of  theory  resolution  to  combine  a  terminological  reasoning 
component  and  an  assertional  reasoning  component  (for  which  they  are  also  utilizing  the 
KLAUS  deduction  system). 

Theory  resolution  is  a  procedure  with  substantial  power  and  generality.  It  is  our  hope 
that  it  will  serve  as  a  base  for  the  theoretical  and  practical  development  of  a  methodology 
for  combining  the  general  reasoning  capabilities  of  resolution  theorem-proving  programs 
with  more  efficient  specialized  reasoning  procedures. 

One  important  area  for  further  research  on  theory  resolution  is  finding  restrictions  on 
the  need  for  retention  of  tautologies  and  determining  compatibility  with  other  resolution 
refinements. 

Another  important  research  question  is  handling  combinations  of  theories  (beyond  the 
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trivial  case  of  totally  disjoint  theories).  Successful  combining  of  multiple  deductive  spe¬ 
cialists  within  a  resolution  framework  awaits  further  development  in  this  area.  The  work 
of  Nelson  and  Oppen  [22]  and  Shostak  [28]  on  combining  quantifier  free  theories  may  be 
relevant. 
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